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While it is empirically accepted that the fixed-node diffusion Monte-Carlo (FN-DMC) depends 
only weakly on the size (beyond a certain reasonable level) of the one-particle basis sets used to 
expand its guiding functions, limits of this observation are not settled yet. Our recent work indicates 
that under the FN error cancellation conditions, augmented triple zeta basis sets are sufficient to 
achieve high-quality benchmark single-point energy differences in a number of small noncovalent 
complexes. In this preliminary progress report, we report on a possibility of significant truncation 
of the one-particle basis sets used to express the FN-DMC guiding functions, that has no visible 
effect on the accuracy of the production energy differences. The proposed scheme shows negligible 
increase of the local energy variance, indicating that the total CPU cost of large-scale benchmark 
noncovalent interaction energy FN-DMC calculations employing Gaussians may be reduced. 


In the domain of benchmark ab-initio noncovalent in¬ 
teraction energy calculations, fixed-node diffusion Monte 
Carlo (FN-DMC) method provides a promising alter¬ 
native to the commonly used coupled-cluster (CC) ap¬ 
proaches like CCSD(T)i. Accurate FN-DMC interac¬ 
tion energies (to 0.1 kcal/mol vs. CCSD(T)/CBS) have 
been recently reported on a number of small/medium 
noncovalent closed-shell complexes^— . In addition to 
accuracy, FN-DMC is attractive also for its favorable 
low-order polynomial CPU cost scaling^— and favor¬ 
able FN error cancellation^— that enabled its use in 
medium/large complexes^ ' 8 1 9 1 1 6 ~ — where CC methods 
were intractable (in original formulation and reasonable 
basis set) until recently— Furthermore, direct treatment 
of cxtended2i22r— and/or multireference systems^ makes 
FN-DMC an attractive many-body method worth of fur¬ 
ther research and development. 

While it is empirically accepted that the FN-DMC re¬ 
sults depend only weakly on the one-particle basis setsi 
used to expand the FN-DMC guiding functions, limits 
of this assumption in energy differences remain unclear. 
Our recent work e.g. indicates that in FN-DMC calcula¬ 
tions using DFT-based single-determinant guiding func¬ 
tions, the basis set cardinality is not as important as the 
presence of augmentation functions— An example of 
ammonia dimer complex well illustrates this behavior: a 
sequence of VTZ, VQZ and aug-VTZ basis sets gener¬ 
ates the interaction energies of -3.33±0.07, -3.47±0.07 
and -3.10±0.06 kcal/moli^, while the complete-basis- 
set(CBS) CCSD(T) benchmark value at the same geom¬ 
etry amounts to -3.15 kcal/mo l 30 ’ 31 . Augmented triple 
zeta basis sets were confirmed to be sufficient to achieve 
a level of 0.1 kcal/mol in the final FN-DMC interac¬ 
tion energies in numerous case s 7 ' 15 . Some residual errors 
that remain in certain types of noncovalent interactions 
(e.g. stacking or hydrogen bonds combined with multi¬ 
ple bonding) are not yet understood and require further 
attention— 

QMC is nevertheless a very promising methodology, 


and its main limitation in area of noncovalent interac¬ 
tions is the CPU cost that stems from the tight statistical 
convergence required in case of small energy differences. 
It is therefore important to map out strategies of possi¬ 
ble CPU cost reduction. For instance, presence of the FN 
error cancellatio n 1 ' 7 ' 13 ' 14 ' 32 enabled use of economic Jas- 
trow factor with electron-electron and electron-nucleus 
terms instead of the more demanding one including also 
electron-electron-nucleus terms— 

Here we report on a possibility of truncation of the one- 
particle Gaussian basis sets used to expand the orbitals 
in the single Slater determinant FN-DMC guiding func¬ 
tions without affecting the accuracy of the final energy 
differences. A series of tests 7 led us to the finding that 
high angular momentum basis functions are not critically 
importan t 14 ' 33 and the augmentation basis set is neces¬ 
sary, but the acceptable accuracy is achieved already with 
a single diffuse s function per atom used instead of the 
common aug- set of basis functions 7 . 

In the following, we compare the two types of basis 
sets used to express the orbitals in single-determinant 
FN-DMC guiding functions. The results obtained 
with trimmed basis set (e.g. [3s3p2d]+[ls] on car¬ 

bon atom) denoted as s-rVTZ are compared to the 
ones obtained with the standard aug-VTZ basis set 
([3s3p2dlf] + [lslpldlf] on carbon atom). Surprisingly, 
the (single-point) interaction energies produced with the 
trimmed bases reveal no statistically significant differ¬ 
ences with respect to the reference calculations, and, no 
significant increase of the local energy variance, implying 
that the equivalent-quality FN-DMC results are available 
at costs lower than assumed to date. 

The considered test set contains seven noncovalent 
closed-shell complexes (with geometries obtained) from 
the A24 database— ammonia dimer (AM...AM), water- 
ammonia (WA...AM), water dimer (WA...WA), am¬ 
monia methane complex (AM...MT), methane dimer 
(MT...MT), hydrogen fluoride dimer (HF...HF), and 
HCN dimer (HCN...HCN). The atomic cores in these 
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complexes were replaced by the effective core poten¬ 
tials (ECPs) developed by Burkatzki et al^ 7 . The 
ECPs were used in combination with Dunning-type aug- 
VTZ basis sets or their truncated counterparts (s-rVTZ, 
see above). Single-determinant Slater-JastrowiZ trial 
wave functions were constructed using orbitals from 
B3LYP (GAMESS^ code) and the Schmidt-Moskowita^Z 
isotropic Jastrow factorsZZ including electron-electron 
and electron-nucleus termsZZ were expanded in a fixed 
basis set of polynomial Pade functions^. The Jastrow 
parameters were refined by the Hessian driven varia¬ 
tional Monte Carlo optimization using at least 10x10 it¬ 
erations and linear combination^ of energy (95%) and 
variance (5%) as a cost function. The orbital parameters 
were frozen to keep the nodes of the guiding functions 
intact. The production FN-DMC runs used time step 
of 0.005 a.u and the T-moves scheme for the treatment 
of ECPs^Z. The target walker populations in FN-DMC 
amounted to 16-32k. All QMC calculations were per¬ 
formed using the code QWaHs^S. 

The FN-DMC results obtained with single¬ 
determinant Slater-Jastrow guiding functions expanded 
in aug-VTZ and reduced s-rVTZ basis sets are reported 
in Table [I] The interaction energies obtained with 
s-rVTZ basis are clearly compatible with the aug-VTZ 
data in all considered cases (Figure 1), that is a non¬ 
trivial and very interesting observation, since, in the 
trimmed case, the total number of basis functions is 
reduced by more than two times, and, the local energy 
variance increases only by up to 4% (Table HJ. 

Why is this the case? As mentioned above, VTZ or 
even VQZ bases do not lead to the correct energy differ¬ 
ences in the similar setting, so the non-augmented rVTZ 
basis sets could hardly lead to the more accurate results. 
We thus presumably attribute the observed agreement 
of s-rVTZ and aug-VTZ results to the similar degree of 
FN error cancellation and sufficient sampling of intersti¬ 
tial regions secured by the single s function per atom, 
working even in such a complex like HF dimer 7 . The 
modest increase of a 2 is attributed to the use of identical 
occupied-shell contractions. 

Let us discuss the CPU cost benefits of the proposed 
basis-set size trimming. The Slater matrix evaluation 
step scales as 0(MN 3 ) where M is the number of basis 
functions and N is the number of electrons. The ideal 
speedup with reduced M for asymptotically large fixed 
N is proportional to 


°fef M ref 

°trim-^trim 


( 1 ) 


where M le f is the number of basis functions in the refer¬ 
ence basis set (aug-VTZ), M tr ; m is the count of the basis 
functions in the trimmed case (s-rVTZ), and er 2 s denote 
the respective local energy variances. As the formula 
(1) indicates, the cost savings start to become interest¬ 
ing for M tr i m significantly smaller than M re f and only 
if the ratio of local energy variances of ef /of rim does not 
outweight the gain from M re f/M tr i m . Furthermore, the 



Figure 1. Demonstration of the agreement between the FN- 
DMC interaction energies A E (keal/mol) from Table [I] cal¬ 
culated with aug-VTZ and truncated s-rVTZ basis sets. The 
error bars (not shown) are smaller than the symbol size. 


observed speedup is smaller than the theoretical limit Sj, 
since, other routines, like evaluation of Jastrow factor, 
pseudopotentials and/or distance matrix updates, also 
add to the overall cost. The observed speedup thus ap¬ 
proaches Si only for sufficiently large M where the Slater 
matrix updates dominate the overall CPU cost. Even 
in the theoretical limit, the CPU cost gain grows lin¬ 
early that may appear unimportant. Nevertheless, the 
FN-DMC calculations are very expensive in general, and 
frequently millions of CPU-hours are invested in valu¬ 
able projects (e.g. Ref. ©■ Therefore, in our opinion, 
any CPU cost reduction is important, and the scheme 
presented here (i.e. use of s-rVTZ instead of aug-VTZ, 
in combination with Jastrow factor containing electron- 
electron and electron-nucleus terms) asymptotically of¬ 
fers about two-fold speedup (cf. Table Q] and examples 
below). 

The following examples illustrate the wave function 
evaluation speedups (compare to M re f/M tr i m ) in pure 
Slater and Slater-Jastrow runs: 

i) In the HCN dimer (20 electrons) where 
M re f/Mt r i m = 2.02, the practical Slater evaluations 
with s-rVTZ are 1.5 times faster than in the case 
of aug-VTZ and the Slater-Jastrow run achieves an 
evaluation cost speedup of 1.22, rather far from the 
expected value. This indicates that the system is “small” 
and routines other than the Slater matrix value and 
Laplacian updates are still important. 

ii) In the case of a larger complex, coronene...H 2 with 
110 electrons (M re f/M tr i m = 1402/692 = 2.03), the 
speedup of a pure Slater run is 1.92-fold while the FN- 
DMC calculation using Slater-Jastrow guiding function 
achieves a speedup factor of 1.74. Note that the local 
energy variance ratio of ef /of rim = 2.590/2.707 = 0.96 so 
the total observed speedup of the calculation achieves a 
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Table I. Comparison of interaction energies A E (kcal/mol) and local energy variances in dimers cr 2 (a.u. 2 ) from FN-DMC 
calculations using aug-VTZ (AEs taken from Ref. [lS) and trimmed, s-rVTZ, basis sets, the related counts of the basis functions 
M in the dimer of the given complex, and, ideal expected speedup ( Si , see text) of s-rVTZ vs. aug-VTZ calculation. 


Complex 

aug-VTZ 

A E a 2 M 

s-rVTZ 

A E a 2 M 

Si 

AM...AM 

-3.30±0.04 0.433 228 

-3.36i0.08 0.445 106 

2.09 

AM...WA 

-6.71i0.07 0.561 205 

-6.64i0.09 0.578 96 

2.07 

WA...WA 

-5.30i0.05 0.667 182 

-5.25i0.09 0.682 86 

2.07 

HF...HF 

-4.89i0.05 0.960 136 

-4.89i0.10 0.968 66 

2.04 

AM...MT 

-0.83i0.06 0.360 251 

-0.77i0.07 0.364 106 

2.34 

MT...MT 

-0.63i0.03 0.271 274 

-0.65i0.04 0.282 126 

2.09 

HCN...HCN 

-5.09i0.08 0.582 226 

-4.97i0.08 0.605 112 

1.94 


factor of 1.67 (sj=1.94). In larger systems, the speedups 
clearly grow toward the theoretical limit and become in¬ 
teresting. 

In conclusion, a new one-particle basis set truncation 
scheme has been shown to considerably reduce compu¬ 
tational requirements while retaining a full accuracy in 
energy differences of seven noncovalent complexes. This 
finding presumably extends the range of applicability for 
FN-DMC method in area of noncovalent interactions. 
Further tests including extended sets of molecules, other 
truncation schemes, and various combinations of tuned 
bases with various Jastrows are now underway 7 . 
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